Overview

Dataset statistics

Number of variables27
Number of observations296
Missing cells409
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.3 KiB
Average record size in memory260.4 B

Variable types

NUM15
BOOL8
CAT3
DATE1

Reproduction

Analysis started2020-05-05 17:14:44.664786
Analysis finished2020-05-05 17:15:20.078296
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
month is highly correlated with quarter and 1 other fieldsHigh Correlation
quarter is highly correlated with month and 1 other fieldsHigh Correlation
weekofyear is highly correlated with quarter and 1 other fieldsHigh Correlation
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventaHigh Correlation
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresaHigh Correlation
udsstock has 93 (31.4%) missing values Missing
udsventa has 61 (20.6%) missing values Missing
udsprevisionempresa has 82 (27.7%) missing values Missing
roll4wd_udsventa has 50 (16.9%) missing values Missing
meanwd_udsventa has 42 (14.2%) missing values Missing
roll4wd_udsstock has 16 (5.4%) missing values Missing
roll4wd_udsprevisionempresa has 65 (22.0%) missing values Missing
weekday has 42 (14.2%) zeros Zeros
sin_weekday has 42 (14.2%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count296
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10643.0
Minimum23
Maximum21263
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum23
5-th percentile1085
Q15333
median10643
Q315953
95-th percentile20201
Maximum21263
Range21240
Interquartile range (IQR)10620

Descriptive statistics

Standard deviation6162.628011
Coefficient of variation (CV)0.5790311013
Kurtosis-1.2
Mean10643
Median Absolute Deviation (MAD)5328
Skewness0
Sum3150328
Variance37977984
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 23. 21263.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1535 1 0.3%
 
19895 1 0.3%
 
20399 1 0.3%
 
2255 1 0.3%
 
9095 1 0.3%
 
18383 1 0.3%
 
18455 1 0.3%
 
18743 1 0.3%
 
1751 1 0.3%
 
8591 1 0.3%
 
Other values (286) 286 96.6%
 
ValueCountFrequency (%) 
23 1 0.3%
 
95 1 0.3%
 
167 1 0.3%
 
239 1 0.3%
 
311 1 0.3%
 
ValueCountFrequency (%) 
21263 1 0.3%
 
21191 1 0.3%
 
21119 1 0.3%
 
21047 1 0.3%
 
20975 1 0.3%
 

fecha
Date

UNIFORM
UNIQUE
Distinct count296
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2019-06-05 00:00:00
Maximum2020-03-26 00:00:00
Histogram

producto
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
32
296
ValueCountFrequency (%) 
32 296 100.0%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

udsstock
Real number (ℝ≥0)

MISSING
Distinct count93
Unique (%)45.8%
Missing93
Missing (%)31.4%
Infinite0
Infinite (%)0.0%
Mean1141.9901477832511
Minimum129.0
Maximum2416.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum129
5-th percentile483.1
Q1858.5
median1124
Q31401.5
95-th percentile1821
Maximum2416
Range2287
Interquartile range (IQR)543

Descriptive statistics

Standard deviation419.4077978
Coefficient of variation (CV)0.3672604345
Kurtosis0.2753359395
Mean1141.990148
Median Absolute Deviation (MAD)328.5513844
Skewness0.3188856787
Sum231824
Variance175902.9009
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1511 6 2.0%
 
878 5 1.7%
 
1111 5 1.7%
 
710 5 1.7%
 
1460 5 1.7%
 
788 5 1.7%
 
1046 5 1.7%
 
1150 4 1.4%
 
840 4 1.4%
 
1279 4 1.4%
 
Other values (83) 155 52.4%
 
(Missing) 93 31.4%
 
ValueCountFrequency (%) 
129 1 0.3%
 
155 1 0.3%
 
323 1 0.3%
 
362 1 0.3%
 
374 2 0.7%
 
ValueCountFrequency (%) 
2416 1 0.3%
 
2274 2 0.7%
 
2219 1 0.3%
 
2157 2 0.7%
 
2080 1 0.3%
 

udsventa
Real number (ℝ≥0)

MISSING
Distinct count90
Unique (%)38.3%
Missing61
Missing (%)20.6%
Infinite0
Infinite (%)0.0%
Mean621.9744680851064
Minimum147.0
Maximum2410.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum147
5-th percentile272
Q1432
median580
Q3738
95-th percentile1082
Maximum2410
Range2263
Interquartile range (IQR)306

Descriptive statistics

Standard deviation292.6176427
Coefficient of variation (CV)0.4704656826
Kurtosis8.042008973
Mean621.9744681
Median Absolute Deviation (MAD)206.7211227
Skewness2.007398432
Sum146164
Variance85625.08482
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
688 8 2.7%
 
472 7 2.4%
 
639 7 2.4%
 
423 6 2.0%
 
442 6 2.0%
 
560 6 2.0%
 
492 6 2.0%
 
708 5 1.7%
 
511 5 1.7%
 
580 5 1.7%
 
Other values (80) 174 58.8%
 
(Missing) 61 20.6%
 
ValueCountFrequency (%) 
147 1 0.3%
 
186 1 0.3%
 
196 1 0.3%
 
206 1 0.3%
 
216 3 1.0%
 
ValueCountFrequency (%) 
2410 1 0.3%
 
1889 1 0.3%
 
1800 1 0.3%
 
1751 1 0.3%
 
1535 1 0.3%
 

udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count195
Unique (%)91.1%
Missing82
Missing (%)27.7%
Infinite0
Infinite (%)0.0%
Mean3088.8271028037384
Minimum51.0
Maximum19542.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum51
5-th percentile467.6
Q11456.25
median2639.5
Q33788
95-th percentile7415.9
Maximum19542
Range19491
Interquartile range (IQR)2331.75

Descriptive statistics

Standard deviation2519.812359
Coefficient of variation (CV)0.8157829087
Kurtosis11.80325765
Mean3088.827103
Median Absolute Deviation (MAD)1702.171849
Skewness2.649602808
Sum661009
Variance6349454.322
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2371 3 1.0%
 
1389 2 0.7%
 
1399 2 0.7%
 
303 2 0.7%
 
3140 2 0.7%
 
1865 2 0.7%
 
3788 2 0.7%
 
3825 2 0.7%
 
4227 2 0.7%
 
129 2 0.7%
 
Other values (185) 193 65.2%
 
(Missing) 82 27.7%
 
ValueCountFrequency (%) 
51 1 0.3%
 
129 2 0.7%
 
159 1 0.3%
 
216 1 0.3%
 
228 1 0.3%
 
ValueCountFrequency (%) 
19542 1 0.3%
 
16054 1 0.3%
 
13299 1 0.3%
 
9635 1 0.3%
 
9414 1 0.3%
 

promo
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
296
ValueCountFrequency (%) 
0 296 100.0%
 

festivo
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
288
1
 
8
ValueCountFrequency (%) 
0 288 97.3%
 
1 8 2.7%
 

weekday
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9966216216216215
Minimum0
Maximum6
Zeros42
Zeros (%)14.2%
Memory size2.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.997453142
Coefficient of variation (CV)0.6665683542
Kurtosis-1.241520413
Mean2.996621622
Median Absolute Deviation (MAD)1.706560446
Skewness0.004680305814
Sum887
Variance3.989819056
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 43 14.5%
 
2 43 14.5%
 
6 42 14.2%
 
5 42 14.2%
 
4 42 14.2%
 
1 42 14.2%
 
0 42 14.2%
 
ValueCountFrequency (%) 
0 42 14.2%
 
1 42 14.2%
 
2 43 14.5%
 
3 43 14.5%
 
4 42 14.2%
 
ValueCountFrequency (%) 
6 42 14.2%
 
5 42 14.2%
 
4 42 14.2%
 
3 43 14.5%
 
2 43 14.5%
 

quarter
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
4
92
3
92
1
86
2
26
ValueCountFrequency (%) 
4 92 31.1%
 
3 92 31.1%
 
1 86 29.1%
 
2 26 8.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

month
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count10
Unique (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.993243243243243
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.667533456
Coefficient of variation (CV)0.5244395666
Kurtosis-1.215710455
Mean6.993243243
Median Absolute Deviation (MAD)3.109751644
Skewness-0.3478227975
Sum2070
Variance13.45080165
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 6.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12 31 10.5%
 
10 31 10.5%
 
8 31 10.5%
 
7 31 10.5%
 
1 31 10.5%
 
11 30 10.1%
 
9 30 10.1%
 
2 29 9.8%
 
6 26 8.8%
 
3 26 8.8%
 
ValueCountFrequency (%) 
1 31 10.5%
 
2 29 9.8%
 
3 26 8.8%
 
6 26 8.8%
 
7 31 10.5%
 
ValueCountFrequency (%) 
12 31 10.5%
 
11 30 10.1%
 
10 31 10.5%
 
9 30 10.1%
 
8 31 10.5%
 

weekofyear
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count43
Unique (%)14.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.469594594594593
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum1
5-th percentile3
Q111
median31
Q342
95-th percentile50
Maximum52
Range51
Interquartile range (IQR)31

Descriptive statistics

Standard deviation15.97664889
Coefficient of variation (CV)0.561182873
Kurtosis-1.229228509
Mean28.46959459
Median Absolute Deviation (MAD)13.65613587
Skewness-0.3266565044
Sum8427
Variance255.2533097
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 12.5 23.5 52. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
52 7 2.4%
 
51 7 2.4%
 
29 7 2.4%
 
28 7 2.4%
 
27 7 2.4%
 
26 7 2.4%
 
25 7 2.4%
 
24 7 2.4%
 
12 7 2.4%
 
11 7 2.4%
 
Other values (33) 226 76.4%
 
ValueCountFrequency (%) 
1 7 2.4%
 
2 7 2.4%
 
3 7 2.4%
 
4 7 2.4%
 
5 7 2.4%
 
ValueCountFrequency (%) 
52 7 2.4%
 
51 7 2.4%
 
50 7 2.4%
 
49 7 2.4%
 
48 7 2.4%
 
Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size424.0 B
True
246
False
50
ValueCountFrequency (%) 
True 246 83.1%
 
False 50 16.9%
 

sin_weekday
Real number (ℝ)

ZEROS
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.004759498821957385
Minimum-0.9749279121818236
Maximum0.9749279121818236
Zeros42
Zeros (%)14.2%
Memory size2.4 KiB

Quantile statistics

Minimum-0.9749279122
5-th percentile-0.9749279122
Q1-0.7818314825
median0
Q30.7818314825
95-th percentile0.9749279122
Maximum0.9749279122
Range1.949855824
Interquartile range (IQR)1.563662965

Descriptive statistics

Standard deviation0.7086201304
Coefficient of variation (CV)148.8854514
Kurtosis-1.50521649
Mean0.004759498822
Median Absolute Deviation (MAD)0.6270716718
Skewness-0.0106157593
Sum1.408811651
Variance0.5021424891
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.97492791 -0.8783797 0.8783797 0.97492791], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.4338837391 43 14.5%
 
0.9749279122 43 14.5%
 
-0.4338837391 42 14.2%
 
-0.9749279122 42 14.2%
 
-0.7818314825 42 14.2%
 
0.7818314825 42 14.2%
 
0 42 14.2%
 
ValueCountFrequency (%) 
-0.9749279122 42 14.2%
 
-0.7818314825 42 14.2%
 
-0.4338837391 42 14.2%
 
0 42 14.2%
 
0.4338837391 43 14.5%
 
ValueCountFrequency (%) 
0.9749279122 43 14.5%
 
0.7818314825 42 14.2%
 
0.4338837391 43 14.5%
 
0 42 14.2%
 
-0.4338837391 42 14.2%
 

cos_weekday
Real number (ℝ)

Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.0037955736549281846
Minimum-0.9009688679024191
Maximum1.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum-0.9009688679
5-th percentile-0.9009688679
Q1-0.9009688679
median-0.222520934
Q30.6234898019
95-th percentile1
Maximum1
Range1.900968868
Interquartile range (IQR)1.52445867

Descriptive statistics

Standard deviation0.7079619739
Coefficient of variation (CV)-186.5230498
Kurtosis-1.503349059
Mean-0.003795573655
Median Absolute Deviation (MAD)0.6408877408
Skewness0.009053080122
Sum-1.123489802
Variance0.5012101565
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.90096887 -0.90096887 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-0.222520934 43 14.5%
 
-0.9009688679 43 14.5%
 
-0.222520934 42 14.2%
 
-0.9009688679 42 14.2%
 
0.6234898019 42 14.2%
 
1 42 14.2%
 
0.6234898019 42 14.2%
 
ValueCountFrequency (%) 
-0.9009688679 42 14.2%
 
-0.9009688679 43 14.5%
 
-0.222520934 42 14.2%
 
-0.222520934 43 14.5%
 
0.6234898019 42 14.2%
 
ValueCountFrequency (%) 
1 42 14.2%
 
0.6234898019 42 14.2%
 
0.6234898019 42 14.2%
 
-0.222520934 43 14.5%
 
-0.222520934 42 14.2%
 

is_august
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
265
1
 
31
ValueCountFrequency (%) 
0 265 89.5%
 
1 31 10.5%
 

spring
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
291
1
 
5
ValueCountFrequency (%) 
0 291 98.3%
 
1 5 1.7%
 

summer
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
188
1
108
ValueCountFrequency (%) 
0 188 63.5%
 
1 108 36.5%
 

autumn
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
206
1
90
ValueCountFrequency (%) 
0 206 69.6%
 
1 90 30.4%
 

winter
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
200
1
96
ValueCountFrequency (%) 
0 200 67.6%
 
1 96 32.4%
 

stockMissingType
Categorical

Distinct count3
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
203
2
80
1
 
13
ValueCountFrequency (%) 
0 203 68.6%
 
2 80 27.0%
 
1 13 4.4%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Decimal_Number 3 75.0%
 
Other_Punctuation 1 25.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

roll4wd_udsventa
Real number (ℝ≥0)

MISSING
Distinct count235
Unique (%)95.5%
Missing50
Missing (%)16.9%
Infinite0
Infinite (%)0.0%
Mean610.2785859465738
Minimum211.71428571428572
Maximum1294.875
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum211.7142857
5-th percentile305.125
Q1484
median589.5
Q3726.15625
95-th percentile969.6
Maximum1294.875
Range1083.160714
Interquartile range (IQR)242.15625

Descriptive statistics

Standard deviation196.3305705
Coefficient of variation (CV)0.3217064715
Kurtosis0.2274704857
Mean610.2785859
Median Absolute Deviation (MAD)152.9350335
Skewness0.5366137587
Sum150128.5321
Variance38545.69292
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
503.875 3 1.0%
 
560 2 0.7%
 
451 2 0.7%
 
913.625 2 0.7%
 
601.125 2 0.7%
 
692 2 0.7%
 
564 2 0.7%
 
545.625 2 0.7%
 
352.5 2 0.7%
 
524.875 2 0.7%
 
Other values (225) 225 76.0%
 
(Missing) 50 16.9%
 
ValueCountFrequency (%) 
211.7142857 1 0.3%
 
226.875 1 0.3%
 
265.25 1 0.3%
 
273 1 0.3%
 
275 1 0.3%
 
ValueCountFrequency (%) 
1294.875 1 0.3%
 
1133 1 0.3%
 
1105.857143 1 0.3%
 
1101.6 1 0.3%
 
1073.571429 1 0.3%
 

meanwd_udsventa
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
Distinct count6
Unique (%)2.4%
Missing42
Missing (%)14.2%
Infinite0
Infinite (%)0.0%
Mean623.4713016078018
Minimum409.2368421052632
Maximum877.1025641025641
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum409.2368421
5-th percentile409.2368421
Q1500.925
median602.5789474
Q3750.3
95-th percentile877.1025641
Maximum877.1025641
Range467.865722
Interquartile range (IQR)249.375

Descriptive statistics

Standard deviation153.9666295
Coefficient of variation (CV)0.246950628
Kurtosis-0.9983662055
Mean623.4713016
Median Absolute Deviation (MAD)126.8200556
Skewness0.3074290608
Sum158361.7106
Variance23705.72299
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
602.5789474 43 14.5%
 
750.3 43 14.5%
 
598.1621622 42 14.2%
 
500.925 42 14.2%
 
409.2368421 42 14.2%
 
877.1025641 42 14.2%
 
(Missing) 42 14.2%
 
ValueCountFrequency (%) 
409.2368421 42 14.2%
 
500.925 42 14.2%
 
598.1621622 42 14.2%
 
602.5789474 43 14.5%
 
750.3 43 14.5%
 
ValueCountFrequency (%) 
877.1025641 42 14.2%
 
750.3 43 14.5%
 
602.5789474 43 14.5%
 
598.1621622 42 14.2%
 
500.925 42 14.2%
 

roll4wd_udsstock
Real number (ℝ≥0)

MISSING
Distinct count240
Unique (%)85.7%
Missing16
Missing (%)5.4%
Infinite0
Infinite (%)0.0%
Mean1154.8951913265305
Minimum362.0
Maximum2416.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum362
5-th percentile593.16
Q1899.35625
median1130.964286
Q31390.325
95-th percentile1785.471429
Maximum2416
Range2054
Interquartile range (IQR)490.96875

Descriptive statistics

Standard deviation368.2275653
Coefficient of variation (CV)0.3188406776
Kurtosis0.2908491158
Mean1154.895191
Median Absolute Deviation (MAD)289.5266522
Skewness0.4576719462
Sum323370.6536
Variance135591.5399
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
710 10 3.4%
 
1072 5 1.7%
 
1279 4 1.4%
 
1382 3 1.0%
 
2157 3 1.0%
 
1414.625 2 0.7%
 
1460 2 0.7%
 
1925 2 0.7%
 
1159.25 2 0.7%
 
947.625 2 0.7%
 
Other values (230) 245 82.8%
 
(Missing) 16 5.4%
 
ValueCountFrequency (%) 
362 1 0.3%
 
452.25 1 0.3%
 
462 1 0.3%
 
468.25 1 0.3%
 
478 1 0.3%
 
ValueCountFrequency (%) 
2416 1 0.3%
 
2219 1 0.3%
 
2157 3 1.0%
 
2093 1 0.3%
 
2042 1 0.3%
 

meanwd_udsstock
Real number (ℝ≥0)

Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1126.9416993751045
Minimum866.375
Maximum1385.392857142857
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum866.375
5-th percentile866.375
Q1886.6923077
median1113.433333
Q31366.7
95-th percentile1385.392857
Maximum1385.392857
Range519.0178571
Interquartile range (IQR)480.0076923

Descriptive statistics

Standard deviation198.2898471
Coefficient of variation (CV)0.1759539533
Kurtosis-1.518103355
Mean1126.941699
Median Absolute Deviation (MAD)176.0370739
Skewness0.003869696391
Sum333574.743
Variance39318.86348
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 866.375 876.53365385 1376.04642857 1385.39285714], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1366.7 43 14.5%
 
1113.433333 43 14.5%
 
866.375 42 14.2%
 
1021.266667 42 14.2%
 
886.6923077 42 14.2%
 
1243.344828 42 14.2%
 
1385.392857 42 14.2%
 
ValueCountFrequency (%) 
866.375 42 14.2%
 
886.6923077 42 14.2%
 
1021.266667 42 14.2%
 
1113.433333 43 14.5%
 
1243.344828 42 14.2%
 
ValueCountFrequency (%) 
1385.392857 42 14.2%
 
1366.7 43 14.5%
 
1243.344828 42 14.2%
 
1113.433333 43 14.5%
 
1021.266667 42 14.2%
 

roll4wd_udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count229
Unique (%)99.1%
Missing65
Missing (%)22.0%
Infinite0
Infinite (%)0.0%
Mean3171.4648577612866
Minimum51.0
Maximum19542.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum51
5-th percentile280.1875
Q11636.4375
median2602.75
Q33706.25
95-th percentile7442.625
Maximum19542
Range19491
Interquartile range (IQR)2069.8125

Descriptive statistics

Standard deviation2660.373793
Coefficient of variation (CV)0.8388470037
Kurtosis10.90249732
Mean3171.464858
Median Absolute Deviation (MAD)1703.435039
Skewness2.750750737
Sum732608.3821
Variance7077588.721
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
216 2 0.7%
 
129 2 0.7%
 
4023 1 0.3%
 
2516 1 0.3%
 
1811 1 0.3%
 
2940 1 0.3%
 
2882.25 1 0.3%
 
1914.571429 1 0.3%
 
1408.875 1 0.3%
 
1678 1 0.3%
 
Other values (219) 219 74.0%
 
(Missing) 65 22.0%
 
ValueCountFrequency (%) 
51 1 0.3%
 
100 1 0.3%
 
129 2 0.7%
 
153.8571429 1 0.3%
 
159 1 0.3%
 
ValueCountFrequency (%) 
19542 1 0.3%
 
16054 1 0.3%
 
15398.75 1 0.3%
 
13299 1 0.3%
 
13239.25 1 0.3%
 

meanwd_udsprevisionempresa
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2516.6168657385765
Minimum216.0
Maximum5358.631578947368
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum216
5-th percentile216
Q11057.454545
median2249.763158
Q33787.487179
95-th percentile5358.631579
Maximum5358.631579
Range5142.631579
Interquartile range (IQR)2730.032634

Descriptive statistics

Standard deviation1577.100626
Coefficient of variation (CV)0.6266749014
Kurtosis-0.6704399884
Mean2516.616866
Median Absolute Deviation (MAD)1257.793139
Skewness0.3548348885
Sum744918.5923
Variance2487246.385
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 216. 636.72727273 2180.39473684 2524.36842105 5358.63157895], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2798.973684 43 14.5%
 
3787.487179 43 14.5%
 
1057.454545 42 14.2%
 
2111.026316 42 14.2%
 
2249.763158 42 14.2%
 
5358.631579 42 14.2%
 
216 42 14.2%
 
ValueCountFrequency (%) 
216 42 14.2%
 
1057.454545 42 14.2%
 
2111.026316 42 14.2%
 
2249.763158 42 14.2%
 
2798.973684 43 14.5%
 
ValueCountFrequency (%) 
5358.631579 42 14.2%
 
3787.487179 43 14.5%
 
2798.973684 43 14.5%
 
2249.763158 42 14.2%
 
2111.026316 42 14.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexfechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdayis_augustspringsummerautumnwinterstockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
0232019-06-0532478.0560.013299.00.00.022623True0.974928-0.222521001000.0560.0602.578947478.01113.43333313299.002798.973684
1952019-06-0632NaN806.019542.00.00.032623True0.433884-0.900969001002.0806.0750.300000NaN1366.70000019542.003787.487179
21672019-06-0732NaN856.016054.00.00.042623True-0.433884-0.900969001002.0856.0877.102564NaN1243.34482816054.005358.631579
32392019-06-0832NaN275.03572.00.00.052623True-0.974928-0.222521001002.0275.0409.236842NaN1385.3928573572.001057.454545
43112019-06-0932NaNNaNNaN0.00.062623False-0.7818310.623490001002.0NaNNaNNaN866.375000NaN216.000000
53832019-06-1032NaN492.04646.00.00.002624True0.0000001.000000001002.0492.0598.162162NaN886.6923084646.002249.763158
64552019-06-11322157.0570.03438.00.00.012624True0.7818310.623490001000.0570.0500.9250002157.01021.2666673438.002111.026316
75272019-06-12321834.0688.01448.00.00.022624True0.974928-0.222521001000.0592.0602.578947817.01113.43333310336.252798.973684
85992019-06-1332362.0590.02969.00.00.032624True0.433884-0.900969001000.0752.0750.300000362.01366.70000015398.753787.487179
96712019-06-14321382.01092.04795.00.00.042624True-0.433884-0.900969001000.0915.0877.1025641382.01243.34482813239.255358.631579

Last rows

df_indexfechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdayis_augustspringsummerautumnwinterstockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
286206152020-03-1732NaN1800.01810.00.00.011312True0.7818310.623490000012.0554.125000500.925000871.4285711021.2666672910.1252111.026316
287206872020-03-1832NaN600.03089.00.00.021312True0.974928-0.222521000012.0490.250000602.5789471178.7500001113.4333333965.0002798.973684
288207592020-03-1932NaN1889.03140.00.00.031312True0.433884-0.900969000012.0727.750000750.3000001460.0000001366.7000004681.3753787.487179
289208312020-03-2032581.02410.03776.00.00.041312True-0.433884-0.900969000010.01294.875000877.102564839.5000001243.3448287687.5005358.631579
290209032020-03-2132788.0403.0NaN0.00.051312True-0.974928-0.222521000010.0852.000000409.236842837.0000001385.392857NaN1057.454545
291209752020-03-22321137.0NaNNaN0.00.061312False-0.7818310.623490010010.0NaNNaN593.800000866.375000NaN216.000000
292210472020-03-23321137.0NaN1389.00.00.001313True0.0000001.000000010010.0565.857143598.162162593.800000886.6923082296.7502249.763158
293211192020-03-2432384.0NaN452.00.00.011313True0.7818310.623490010010.0929.857143500.925000770.0000001021.2666672238.1252111.026316
294211912020-03-2532905.0NaN1818.00.00.021313True0.974928-0.222521010010.0550.571429602.5789471072.5000001113.4333333365.0002798.973684
295212632020-03-26321279.0NaN708.00.00.031313True0.433884-0.900969010010.01063.857143750.3000001279.0000001366.7000003582.0003787.487179